Mobile Intelligent Agents for Document Classi cation andRetrieval : A Machine Learning
نویسندگان
چکیده
This paper describes an implementation of intelligent, customizable mobile software agents for document classiication and retrieval. The mobile agents are implemented using the Voyager platform. The agents learn user's interests by interacting with the user. Results of experiments using three diier-ent approaches { TFIDF, Bayesian and DistAl (neural network classiier) { for the design of trainable document classiiers are presented. The performance of each classiier with and without feature subset selection (using genetic algorithms) was explored. Experiments with retrieval of journal paper abstracts and news articles demonstrate the feasibility of using machine learning to design mobile intelligent agents for customized information retrieval.
منابع مشابه
Machine Learning for Intelligent Processing of Printed Documents FLORIANA ESPOSITO , DONATO MALERBA AND FRANCESCA A
A paper document processing system is an information system component which transforms information on printed or handwritten documents into a computer-revisable form. In intelligent systems for paper document processing this information capture process is based on knowledge of the speci c layout and logical structures of the documents. This article proposes the application of machine learning t...
متن کاملClassi cation of Scienti c Papers Using Machine Learning Minh
The project aims to develop a domain-independent and adaptive approach for scienti c document classi cation using both information from document contents and citation links. We evaluate several content-based classi cation methods including K-nearest neighbours, nearest centroid, naive Bayes and decision trees and nd that the naive Bayes outperform other when training set is sufciently large. Us...
متن کاملAutomatic Discovery of DocumentClassi cation Knowledge
We investigate approaches for automatic discovery of document classi cation knowledge from text databases. We review existing rule-based text classi cation learning algorithms such as SWAP-1 and RIPPER. After identifying their weakness, we propose a new technique known as the IBRI algorithm by unifying the strengths of rule-based learning and instance-based approaches and adapting to characteri...
متن کاملText Classification for Intelligent Agent Portfolio Management
In the application domain of stock portfolio management, software agents that evaluate the risks associated with the individual companies of a portfolio should be able to read electronic news articles that are written to give investors an indication of the nancial outlook of a company. There is a positive correlation between news reports on a company's nancial outlook and the company's attracti...
متن کاملIntelligent Feature Selection for Opinion Classification
References 1. T. Macer, M. Pearson, and F. Sebastiani, “Cracking the Code: What Customers Say, in their own Words,” Proc. 50th Ann. Conf. Market Research Soc. (MRS 07), MRS, 2007. 2. D. Giorgetti and F. Sebastiani, “Automating Survey Coding by Multiclass Text Categorization Techniques,” J. Am. Soc. Information Science and Technology, vol. 54, no. 14, 2003, pp. 1269–1277. 3. G. Forman, “Quantify...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998